AITopics | linear function

Collaborating Authors

linear function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Weighted universal approximation of differentiable maps on infinite-dimensional manifolds

Schmocker, Philipp, Teichmann, Josef

arXiv.org Machine LearningJun-30-2026

We generalize the universal approximation theorem for functional input neural networks (FNN) to differentiable maps by including the approximation of the derivatives. A FNN maps the input from a possibly infinite-dimensional weighted manifold to the real-valued hidden layer, on which a non-linear scalar activation function is applied, and then returns the output into a Banach space via some linear readouts. By proving a weighted Nachbin theorem, we establish a universal approximation theorem for differentiable maps, which goes beyond the usual formulation on compact sets and also includes the approximation of the derivatives. This leads us to approximation results for non-anticipative functionals including the horizontal and vertical derivatives. As a further application, we show that linear functions of the signature are able to approximate path space functionals including their directional derivatives.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Machine Learning

2606.0982

Country:

North America > United States (1.00)
Europe > United Kingdom > England (0.27)

Genre:

Instructional Material (0.45)
Research Report (0.40)

Industry: Banking & Finance (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Regional Explanations: Bridging Local and Global Variable Importance

Neural Information Processing SystemsJun-22-2026, 23:21:57 GMT

We analyze two widely used local attribution methods, Local Shapley Values and LIME, which aim to quantify the contribution of a feature value xi to a specific prediction f(x1,...,xp). Despite their widespread use, we identify fundamental limitations in their ability to reliably detect locally important features, even under ideal conditions with exact computations and independent features. We argue that a sound local attribution method should not assign importance to features that neither influence the model output (e.g., features with zero coefficients in a linear model) nor exhibit statistical dependence with functionality-relevant features. We demonstrate that both Local SV and LIME violate this fundamental principle. To address this, we propose R-LOCO (Regional Leave Out COvariates), which bridges the gap between local and global explanations and provides more accurate attributions.

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

GIST: Greedy Independent Set Thresholding for Max-Min Diversification with Submodular Utility

Neural Information Processing SystemsJun-16-2026, 19:42:22 GMT

This work studies a novel subset selection problem called max-min diversification with monotone submodular utility (MDMS), which has a wide range of applications in machine learning, e.g., data sampling and feature selection. Given a set of points in a metric space, the goal of MDMS is to maximize f(S) = g(S)+λ div(S) subject to a cardinality constraint |S| k, where g(S)is a monotone submodular function and div(S) = minu,v S:u =v dist(u,v)is the max-min diversity objective. We propose the GIST algorithm, which gives a 1/2-approximation guarantee for MDMS by approximating a series of maximum independent set problems with a bicriteria greedy algorithm. We also prove that it is NP-hard to approximate within a factor of 0.5584. Finally, we show in our empirical study that GISToutperforms state-of-the-art benchmarks for a single-shot data sampling task on ImageNet.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

How does feature learning reshape the function space?

Lobo, João, Loureiro, Bruno, Tran-Than, Long, Liu, Fanghui

arXiv.org Machine LearningMay-19-2026

Feature learning is widely regarded as the key mechanism distinguishing neural networks from fixed-kernel methods, yet its impact on the induced function space remains poorly understood. In this work, we precisely characterize how the function space spanned by the features of a two-layer neural network evolves during gradient descent training. We prove that, in the high-dimensional proportional regime, after a large gradient step the post-update feature distribution is well approximated by a target-dependent spiked Gaussian covariance. This induces a data-adaptive kernel that reshapes the function space and modifies its spectral structure. Our analysis reveals that feature learning can be interpreted as a distributional transformation in either parameter space or input space, equivalently as the introduction of a target-dependent kernel. In particular, it selectively amplifies eigenvalues aligned with the target direction and mixes leading eigenfunctions, coupling the top radial mode with a target-aligned quadratic harmonic. Overall, our results provide a precise function-space perspective on early-stage feature learning: rather than just rescaling a fixed kernel, gradient descent induces a data-adaptive deformation that preferentially enhances directions aligned with the signal in the data.

artificial intelligence, kernel, machine learning, (18 more...)

arXiv.org Machine Learning

2605.17718

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.68)

Add feedback

3d36c07721a0a5a96436d6c536a132ec-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 13:16:09 GMT

Figure S1: Estimated Networks 1 & 3 from linear factor models of DS (Top) and Granger causality (Bottom) for simulated data experiment. Each panel shows a grid of DS or Granger causality (GC) features associated with the indicated network estimate. Within each grid, a plot corresponds to signal that is being transmitted from the channel listed on the left to the channel listed at the top. See Figure 1 for a description of the true networks. Each subplot represents the DS from the region listed on the left to the region listed on top. Power spectra are reasonable to model using a linear factor model because they satisfy Definition 1 under reasonable assumptions. We will use Scc(ω) to refer to the spectral power of the signal vc(t) at frequency ω, and vc(ω) to refer to the frequency domain representation of vc(t) at ω.

artificial intelligence, directed spectrum, machine learning, (15 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Neurology (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

3d36c07721a0a5a96436d6c536a132ec-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 13:16:06 GMT

artificial intelligence, brain network, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Data Science (0.68)

Add feedback

Overcoming the Convex Barrier for Simplex Inputs: Supplementary Material

Neural Information Processing SystemsApr-25-2026, 04:20:52 GMT

Strong mixed-integer programming formulations for trained neural networks.

artificial intelligence, machine learning, relaxation, (16 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Overcoming the Convex Barrier for Simplex Inputs

Neural Information Processing SystemsApr-25-2026, 04:20:48 GMT

Recent progress in neural network verification has challenged the notion of a convex barrier, that is, an inherent weakness in the convex relaxation of the output of a neural network. Specifically, there now exists a tight relaxation for verifying the robustness of a neural network to ` input perturbations, as well as efficient primal and dual solvers for the relaxation. Buoyed by this success, we consider the problem of developing similar techniques for verifying robustness to input perturbations within the probability simplex. We prove a somewhat surprising result that, in this case, not only can one design a tight relaxation that overcomes the convex barrier, but the size of the relaxation remains linear in the number of neurons, thereby leading to simpler and more efficient algorithms. We establish the scalability of our overall approach via the specification of `1 robustness for CIFAR-10 and MNIST classification, where our approach improves the state of the art verified accuracy by up to 14.4%. Furthermore, we establish its accuracy on a novel and highly challenging task of verifying the robustness of a multi-modal (text and image) classifier to arbitrary changes in its textual input.

artificial intelligence, machine learning, relaxation, (18 more...)

Neural Information Processing Systems

Industry: Information Technology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Linear regression without correspondence

Neural Information Processing SystemsMar-17-2026, 17:16:00 GMT

This article considers algorithmic and statistical aspects of linear regression when the correspondence between the covariates and the responses is unknown. First, a fully polynomial-time approximation scheme is given for the natural least squares optimization problem in any constant dimension. Next, in an average-case and noise-free setting where the responses exactly correspond to a linear function of i.i.d.

artificial intelligence, machine learning, neural information processing system 30, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.69)

Add feedback

Filters

Collaborating Authors

linear function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Weighted universal approximation of differentiable maps on infinite-dimensional manifolds

Regional Explanations: Bridging Local and Global Variable Importance

GIST: Greedy Independent Set Thresholding for Max-Min Diversification with Submodular Utility

How does feature learning reshape the function space?

3d36c07721a0a5a96436d6c536a132ec-Supplemental.pdf

3d36c07721a0a5a96436d6c536a132ec-Paper.pdf

Overcoming the Convex Barrier for Simplex Inputs: Supplementary Material

Overcoming the Convex Barrier for Simplex Inputs

18a9042b3fc5b02fe3d57fea87d6992f-Paper.pdf

Linear regression without correspondence